67 research outputs found
Recommended from our members
Chapter 2: The Original ToBI System and the Evolution of the ToBI Framework
In this chapter, the authors will try to identify the essential properties of a ToBI framework annotation system by describing the development and design of the original ToBI conventions. In this description, the authors will overview the general phonological theory and the specific theory of Mainstream American English intonation and prosody that the authors decided to incorporate in the original ToBI tags. The authors will also state the practical principles that led us to make the decisions that the authors did. The chapter is organised as follows. Section 2.2 briefly chronicles how the MAE_ToBI system came into being. Section 2.3 briefly describes the consensus account of English intonation and prosody on which the MAE_ToBI system is based. Section 2.4 catalogues the different components of a MAE_ToBI transcription and lists the salient rules which constrain the relationships between different components. This section also expands upon the theoretical foundations and practical consequences of adopting the general structure of multiple labelling tiers, and particularly the separation of the labels for tones from the labels for indexing prosodic boundary strength. Section 2.5 then describes some of the extensions of the basic ToBI tiers that have been adopted by some sites. This section also compares our decisions about the number of tiers and about inter-tier constraints with the analogous decisions for some of the other ToBI systems described in this book. Section 2.6 discusses the status of the symbolic labels relative to the continuous phonetic records that are also an obligatory component of the MAE_ToBI transcription. Section 2.7 then closes by listing several open research questions that the authors would like to see addressed by MAE_ToBI users and the larger ToBI community
Prosodic Effects of Discourse Salience and Association with Focus
Three factors that have been argued to influence the prosody of an utterance are (i) which constituents encode discourse-salient information; (ii) which constituents are contrastive in that they evoke alternatives; and (iii) which constituents interact with the meaning of focus operators such as only (i.e., they ‘associate’ with focus). One challenge for a better understanding of these factors and their interaction has been the difficulty of finding a way to evaluate hypotheses quantitatively, since individual variation in productions is often large enough to wash out experimental effects. In this paper, we apply a methodology introduced in [1] to control for such variation and present evidence for how the three factors interact to influence prosody in sentences containing single or multiple foci
Three steps forward for predictability : Consideration of methodological robustness, indexical and prosodic factors, and replication in the laboratory
There is now abundant evidence that phonetic forms are shaped by probabilistic effects reflecting predictability or informativity. We outline a number of challenges for such work, where theoretical claims are often based on small differences in acoustic measurements, or interpretations of small statistical effect sizes. We outline caveats about the methods and assumptions encountered in many studies of predictability effects, particularly regarding corpus-based approaches. We consider the wide range of factors that influence patterns of variability in phonetic forms, taking a broad perspective on what is meant by “the message” in order to show that predictability effects need to be considered alongside many others, including indexical and prosodic factors. We suggest a number of ways forward to extend our understanding of the form-predictability relationship.Full Tex
Speech Communication
Contains reports on five research projects.C.J. LeBel FellowshipKurzweil Applied IntelligenceNational Institutes of Health (Grant 5 T32 NS07040)National Institutes of Health (Grant 5 R01 NS04332)National Science Foundation (Grant 1ST 80-17599)Systems Development FoundationU.S. Navy - Office of Naval Research (Contract N00014-82-K-0727
Speech Communication
Contains reports on eight research projects.C.J. LeBel FellowshipSystems Development FoundationNational Institutes of Health (Grant 5 T32 NS 07040-08)National Institutes of Health (Grant 5 R01 NS 04332-20)National Science Foundation (Grant 1ST 80-1759)National Science Foundation (Grant 1ST 80-17599 and MCS-8112899)U.S. Navy - Office of Naval Research (Contract N00014-82-K-0727
Speech Communication
Contains reports on eight research projects.C.J. LeBel FellowshipSystems Development FoundationNational Institutes of Health (Grant 5 T32 NS07040)National Institutes of Health (Grant 5 R01 NS04332)National Science Foundation (Grant 1ST 80-17599)U.S. Navy - Office of Naval Research (Contract N00014-82-K-0727
Speech Communication
Contains reports on four research projects.C.J. LeBel FellowshipKurzweil Applied IntelligenceNational Institutes of Health (Grant 5 T32 NS07040)National Institutes of Health (Grant 5 RO1 NS04332)National Science Foundation (Grant BNS84-18733)Systems Development FoundationU.S. Navy - Office of Naval Research (Contract N00014-82-K-0727
Lexical Access Model for Italian -- Modeling human speech processing: identification of words in running speech toward lexical access based on the detection of landmarks and other acoustic cues to features
Modelling the process that a listener actuates in deriving the words intended
by a speaker requires setting a hypothesis on how lexical items are stored in
memory. This work aims at developing a system that imitates humans when
identifying words in running speech and, in this way, provide a framework to
better understand human speech processing. We build a speech recognizer for
Italian based on the principles of Stevens' model of Lexical Access in which
words are stored as hierarchical arrangements of distinctive features (Stevens,
K. N. (2002). "Toward a model for lexical access based on acoustic landmarks
and distinctive features," J. Acoust. Soc. Am., 111(4):1872-1891). Over the
past few decades, the Speech Communication Group at the Massachusetts Institute
of Technology (MIT) developed a speech recognition system for English based on
this approach. Italian will be the first language beyond English to be
explored; the extension to another language provides the opportunity to test
the hypothesis that words are represented in memory as a set of
hierarchically-arranged distinctive features, and reveal which of the
underlying mechanisms may have a language-independent nature. This paper also
introduces a new Lexical Access corpus, the LaMIT database, created and labeled
specifically for this work, that will be provided freely to the speech research
community. Future developments will test the hypothesis that specific acoustic
discontinuities - called landmarks - that serve as cues to features, are
language independent, while other cues may be language-dependent, with powerful
implications for understanding how the human brain recognizes speech.Comment: Submitted to Language and Speech, 202
- …